Data Warehouse Configuration
نویسندگان
چکیده
In the data warehousing approach to the integration of data from multiple information sources, selected information is extracted in advance and stored in a repository. A data warehouse (DW) can therefore be seen as a set of materialized views defined over the sources. When a query is posed, it is evaluated locally, using the materialized views, without accessing the original information sources. The applications using DWs require high query performance. This requirement is in conflict with the need to maintain in the DW updated information. The DW configuration problem is the problem of selecting a set of views to materialize in the DW that answers all the queries of interest while minimizing the total query evaluation and view maintenance cost. In this paper we provide a theoretical framework for this problem in terms of the relational model. We develop a method for dealing with it by formulating it as a state space optimization problem and then solving it using an exhaustive incremental algorithm as well as a *Research supported by the European Commission under the ESPRIT Program LTR project No 22469 “DWQ: Foundations of Data Warehouse Quality” Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment. Proceedings of the 23rd VLDB Conference Athens, Greece, 1997 heuristic one. We extend this method by considering the case where auxiliary views are stored in the DW solely for reducing the view maintenance cost.
منابع مشابه
Automated Data Warehousing for Rule-based CRM Systems
This paper proposes a novel way of automatically developing data warehouse configuration in rule-based CRM systems. Rule-based CRM systems assume that marketing activities are represented as a set of IF-THEN rules. Currently, to provide good quality CRM functionalities, CRM systems seek to combine conventional CRM methodologies with data warehousing technology. A data warehouse can be abstractl...
متن کاملEnhanced Architecture of a Web Warehouse based on Quality Evaluation Framework to Incorporate Quality Aspects in Web Warehouse Creation
In the recent years, it has been observed that World Wide Web (www) became a vast source of information explosion about all areas of interest. Relevant information retrieval is difficult from the web space as there is no universal configuration and organization of the web data. Taking the advantage of data warehouse functionality and integrating it with the web to retrieve relevant data is the ...
متن کاملConfigurative Reference Model-based Development of Data Warehouse Systems
Developing Data Warehouse Systems requires specifications of the underlying business need in the form of information models. The development of information models is often both expensive and extensive. Against this background, reference models provide useful means to reduce the costs of information modelling, because they can be used as a starting point for the construction of project-specific ...
متن کاملAn Ontology-Based Autonomic System for Improving Data Warehouses by Cache Allocation Management
With the increase in the amount and complexity of information, data warehouse performance has become a constant issue, especially for decision support systems. As a consequence, decision experts are faced with the management of all this information, and thus realize that special techniques are required to keep good performances. This paper proposes an approach to data warehouse systems improvem...
متن کاملTracing conceptual models' evolution in data warehouses by using the model driven architecture
Developing a data warehouse is an ongoing task where new requirements are constantly being added. A widely accepted approach for developing data warehouses is the hybrid approach, where requirements and data sources must be accommodated to a reconciliated data warehouse model. During this process, relationships between conceptual elements specified by user requirements and those supplied by the...
متن کاملWarehouse redesign to satisfy tight supply chain management constraints
The picking process is a critical supply chain component for many companies. A proper warehouse configuration, storage policy, trays replenishment policy, and other factors are also important to reduce not only the delivery time, but also increase productivity while maintaining quality factors at competitive costs. This paper presents an integrated approach to tackle the complexity of warehouse...
متن کامل